Phoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectories
نویسندگان
چکیده
In this paper, we introduce a novel distinctive phonetic feature (DPF) extraction method that incorporates inhibition/enhancement functionalities by discriminating the DPF dynamic patterns of trajectories relevant or not. The trajectories of each DPF show a convex pattern when the DPF is relevant and a concave one when irrelevant. The proposed algorithm enhances convex type patterns and inhibits concave type patterns. We implement the algorithm into a phoneme recognizer and evaluate it. The recognizer consists of two stages. The first stage extracts 45 dimensional DPF vectors from local features (LFs) of input speech using a hybrid neural network and incorporates an inhibition/enhancement network to obtain modified DPF patterns, and the second stage orthogonalizes the DPF vectors and then feeds them to an HMM-based classifier. The proposed phoneme recognizer significantly improves the phoneme recognition accuracy with fewer mixture components by resolving coarticulation effects.
منابع مشابه
Inhibition/Enhancement Network Based ASR using Multiple DPF Extractors
— This paper describes an evaluation of Inhibition/Enhancement (In/En) network for robust automatic speech recognition (ASR). In distinctive phonetic features (DPFs) based speech recognition using neural network, In/En network is needed to discriminate whether the DPFs dynamic patterns of trajectories are convex or concave. The network is used to achieve categorical DPFs movement by enhancing ...
متن کاملEffects of Syllable Language Model on Distinctive Phonetic Features (DPFs) based Phoneme Recognition Performance
This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incorporating syllable language models (LMs). The method comprises three stages. The first stage extracts three DPF vectors of 15 dimensions each from local features (LFs) of an input speech signal using three multilayer neural networks (MLNs). The second stage incorporates an Inhibition/Enhancement (...
متن کاملDistinctive phonetic feature (DPF) based phone segmentation using hybrid neural networks
Segmentation of speech into its corresponding phones has become very important issue in many speech processing areas such as speech recognition, speech analysis, speech synthesis, and speech database. In this paper, for accurate segmentation in speech recognition applications, we introduce Distinctive Phonetic Feature (DPF) based feature extraction using a twostage NN (Neural Networks) system c...
متن کاملSelected Papers of the IEEE International Conference on Computer and Information Technology
This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incorporating syllable language models (LMs). The method comprises three stages. The first stage extracts three DPF vectors of 15 dimensions each from local features (LFs) of an input speech signal using three multilayer neural networks (MLNs). The second stage incorporates an Inhibition/Enhancement (...
متن کاملSelected Papers of the Thirteenth International Conference on Computer and
— This paper describes an evaluation of Inhibition/Enhancement (In/En) network for robust automatic speech recognition (ASR). In distinctive phonetic features (DPFs) based speech recognition using neural network, In/En network is needed to discriminate whether the DPFs dynamic patterns of trajectories are convex or concave. The network is used to achieve categorical DPFs movement by enhancing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008